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ABSTRACT 


Standard  axiomatizations  of  expec ted - ut i 1 i ty  theory 
envision  an  agent  with  fixed  probability  assessments,  who  can  be 
observed  to  choose  actions  from  varying  opportunity  sets  (for 
instance,  pairs  of  lotteries).  These  axiomatizations  also 
envision  that  the  agent's  preferences  among  these  actions  depend 
on  the  state  of  nature  only  through  the  s tate - dependent 


clea 

may 

and 

acti 

maxi 

util 

patt 

disc 


equences  of  the  actions,  and  that  these  consequences  are 
rly  defined  and  observable.  We  sugge-st  that  this  conception 
be  an  unnecessarily  restrictive  basis  for  empirical  testing, 
instead  study  the  pattern  of  choices  from  a  fixed  set  of 
ons  as  probability  assessments  change.  We  show  that 
mization  of  the  expectation  of  a  general,  s tate - dependent 
ity  function  places  nontrivial  restrictions  on  such  a  choice 
ern.  These  restrictions  are  completely  characterized  by  a 
rete  version  of  an  integrability  condition.  ^ 


1 .  Introduc  t i on 


There  has  been  long-standing  interest  in  testing  expected- 
utility  theory  against  alternative  theories  of  choice  in  risky  or 
uncertain  situations.  Until  recently,  the  only  direct  tests  were 
based  on  experimentally  generated  data.  The  results  of  many  of 
these  laboratory  experiments  were  unfavorable  to  the  expected- 
utility  hypothesis.^  Within  the  past  decade,  though,  a 
substantial  body  of  non- experimental  evidence  has  been  shown  to 
conform  well  to  the  expected-utility  hypothesis.  This  data 
concerns  a  variety  of  dynamic,  stochastic  xhoice  problems  related 
to  optimal  stopping.  (These  studies  include  Miller  (1984),  Pakes 
(1986),  Rust  (1987)  andWolpin  (1984).) 

These  conflicting  results  could  be  reconciled  in  several 
different  ways.  It  is  possible  that  optimal - stopping  problems 
are  somehow  psychologically  special,  and  that  people's 
performance  with  respect  to  these  problems  corresponds  more 
closely  to  expec ted - ut i 1 i ty  maximization  than  does  their  general 
performance.  It  is  also  possible,  though,  that  the  experimental 
evidence  systematically  under-represents  people's  conformity  to 
expected-utility  theory.  This  unde r - repres entat ion  may  be 
especially  severe  with  respect  to  decision  making  by  experts. 
Expected-utility  maximization  with  respect  to  an  area  of 
substantive  expertise  is  a  learned,  non-verbal,  domain- specif ic 


A  particularly  careful  experiment  of  this  sort  was  done  by 
Grether  and  Plott  (1979).  References  to  other  experiments  are 
given  in  the  bibliography  of  Allais  (1979). 
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skill.  Such  skill  may  have  to  be  acquired  slowly  and  through 

intensive  study  ( for  instance,  over  a  period  of  several  years  of 

apprenticeship  or  postgraduate  professional  education),  and  it 

may  not  transfer  instantaneously  to  an  artificial  task  in  an 

experiment ,  even  after  it  has  been  acquired.  These 

considerations  show  the  great  importance  of  further  examining  the 

expected-utility  hypothesis  outside  of  the  laboratory,  and 

particularly  with  respect  to  a  variety  of  decision  situations 

3 

faced  by  experts.  In  this  paper,  we  provide  a  characterization 
of  expec ted - ut il i ty  maximization  that  may  be  useful  for  such 
investigations . 

This  characterization  differs  from  the  usual  ones  (such  as 
Savage  (1954))  in  two  respects  that  we  believe  will  facilitate 
its  empirical  application.  First,  while  the  usual 

characterizations  assume  a  fixed  probability  distribution  of  the 


The  non-verbal  aspect  of  expected-utility  maximization  was 
particularly  emphasized  by  Savage  (1954).  If  experimental 
protocols  require  subjects  to  rely  to  a  large  extent  on  their 
linguistic  capabilities,  then  the  experiments  may  not  be 
representative  of  other  decision-making  problems.  An  observation 
bearing  on  this  point  has  been  made  by  Shortliffe  and  Buchanan 
(1984,  p.  236),  who  note  the  difficulty  of  obtaining  verbal 
probability  estimates  of  diagnoses  fror  ex-erienced  physicians. 
This  observation  need  not  entail  that  physicians'  treatment 
decisions  are  inconsistent  with  expected-utility  maximization. 
The  physicians  may  be  skilled  at  making  treatment  decisions,  but 
yet  have  little  facility  at  verbalizing  the  grounds  for  those 
dec  is  ions . 

3 

The  success  of  applied  economic  theories  that  incorporate 
the  expec ted - ut i 1 i ty  hypothesis  has  sometimes  been  claimed  to 
corroborate  the  hypothesis.  However,  Machina  (1982)  and  Chew, 
Epstein  and  Zilcha  (1986)  show  that  comparative-statics 
predictions  cannot  discriminate  between  expected  utility  and  a 
class  of  alternative  decision  criteria. 
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possible  states  of  nature  and  allow  the  set  of  feasible  actions 

to  vary  (typically,  over  pairs  of  alternative  lotteries),  here 

the  set  of  actions  will  remain  fixed  and  the  probability  measure 

will  be  allowed  to  vary.  Second,  while  the  usual 

characterizations  stipulate  that  the  utility  of  taking  an  action 

in  a  given  state  of  nature  depends  only  on  a  specified 

consequence  of  the  action  in  that  situation,  here  utility  will  be 

4 

defined  directly  in  terms  of  the  action  and  the  state  of  nature. 

To  see  why  the  probability  measure  -needs  to  be  allowed  to 
vary,  consider  the  problem  of  deciding  whether  an  investor's 
demand  for  financial  assets  is  consistent  with  expected-utility 
maximization.  If  the  joint  distribution  of  asset  returns  is 
fixed,  then  this  question  can  be  decided.  (Cf.  Green,  Lau  and 
Polemarchakis  (1978),  Green  and  Srivastava  (1986)  and  Varian 
(1983).)  Changes  in  asset  demand  would  be  imputed  to  variation 
of  the  investor's  budget  set  as  asset  prices  change.  However, 
actual  changes  in  asset  prices  are  thought  to  be  largely  a 
consequence  of  changes  in  investors'  perceptions  of  their  joint 
return  distributions.  Therefore,  demand  theory  based  on  the 
usual  characterization  of  expected  utility  cannot  be  applied 
straigtforwardly . ^  In  contrast,  the  empirical  work  on  dynamic 

4 

In  order  to  avoid  reference  to  consequences,  the  utility 
function  must  be  made  state  c ont ingent . S tate - cont ingent  expected 
utility  has  previously  been  axiomatized,  in  a  fromework  that 
otherwise  resembles  Savage  (1954),  by  Kami  et  al.  (1983). 

^Attanasio  and  Weber  (1987)  and  Epstein  and  Zin  (1987)  have 
recently  considered  the  precise  role  of  expected-utility 
maximization  in  the  determination  of  equilibrium  asset  prices 
under  a  rational  -  expectations  assumption. 
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expected-utility  maximization  has  been  successful  precisely 
because  Bayes'  theorem  makes  it  possible  to  estimate  how  the 
agent's  probability  assessments  evolve. 

To  see  why  avoidance  of  reference  to  consequences  of  actions 
may  facilitate  empirical  work,  consider  an  example  of  decision 
making  in  medicine.  A  state  of  nature  specifies  the  situation 
of  the  patient,  including  the  actual  disease,  state  of  general 
health,  economic  and  family  circumstances,  and  so  forth.  An 
action  is  a  medical  or  surgical  treatment.  A  consequence 
specifies  all  of  the  aspects  of  the  doctor -patient  relationship 
and  of  the  patient's  prognosis  about  which  the  doctor  might  be 
concerned,  and  that  a  treatment  might  affect.  Given  a  patient's 
symptoms  and  medical  history,  the  range  of  treatments  that  might 
reasonably  be  prescribed  will  usually  be  quite  small.  However, 
the  consequence  of  a  treatment  is  likely  to  be  complex  and 
difficult  to  measure  preciselv;  is  the  progress  of  the  disease 
arrested,  is  the  patient  well  enough  to  resume  normal  activities , 
does  the  patient  suffer  pain  or  undesirable  side  effects,  how 
high  is  the  cost  of  treatment  to  the  patient,  how  high  is  the 
doctor's  remuneration,  does  the  accomplishment  of  a  state-of- 
the-art  surgical  procedure  enhance  the  doctor's  prestige  in  the 
medical  community,  and  so  forth.  In  this  situation  where 
observations  about  a  narrow  range  of  observed  choices  among  acts 
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unambiguously  to  a  utility  function  defined  on  consequences  are 
poor. 

This  example  of  medical  decisions  provides  a  good  example  of 
how  the  present  theory  might  be  applied.  Imagine  a  doctor  who 
must  base  each  treatment  decision  on  evidence  from  a  medical 
history  and  an  examination  of  the  patient.  Suppose  that,  after 
the  treatment  is  done,  a  completely  accurate  diagnosis  becomes 
available.  If  the  doctor  treats  many  patients,  then  the 

conditional  relative  frequencies  of  these  diagnoses  should 
associate  a  probability  measure  on  states  of  nature  to  each 
medical  history  and  examination  outcome.  For  each  of  these 
probability  measures,  the  doctor  has  a  preferred  treatment.  That 
is,  the  doctor's  choices  can  be  represented  by  an  empirical 
behavior  pattern  that  assigns  actions  to  probability  measures 
over  states  of  nature. 

Behavior  patterns  of  this  form  are  analyzed  in  this  paper. 
It  is  shown  that  the  hypothesis  of  expected-utility  maximization 
places  theoretical  restrictions  on  the  form  of  the  behavior 
pattern.  Whether  or  not  a  behavior  pattern  satisfies  these 
restrictions  can  be  determined  by  solving  a  system  of  linear 
inequalities.  If  the  behavior  is  consistent  with  the  expected- 
utility  hypothesis,  then  the  system  has  a  solution  that  can  be 
used  to  construct  a  utility  function  that  rationalizes  the 
behavior.  These  results  bear  some  formal  similarity  to  the 
standard  problem  of  revealed  preference  in  demand  theory,  and 
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they  could  In  fact  be  formulated  within  a  generalized  version  of 
that  theory  such  as  Richter  (1979)  has  provided.*’ 

2.  Choice  o_f  actions  under  uncertainty 

The  model  to  be  studied  here  is  a  familiar  one  in  most 


respects.  There  is  a  set  of  acts,  and  a  set  of  states  of  nature. 
For  mathematical  simplicity,  it  will  be  assumed  that  both  of 

these  sets  are  finite.  Let  A  be  the  acts,  and  S-(s^ . sn)  be 

the  states.  It  is  assumed  that  the  subjective  probability 

assessments  of  the  subject  may  be  determined  by  any  probability 
measure  over  S  corresponding  to  an  element  of  a  convex  set  M£Hn 


having  nonempty  interior.  If  peM,  then  the  probability  measure 
corresponding  to  n  assigns  probability  p  ^  to  for  jsl,  and 

probability  to  s^  .  7  It  will  be  convenient  to  define 

f  :  M-»  [  0 , 1  ]  by  f  ( p ) -1  -  •  Then,  if  f:S-*Jt,  the  expectation  of  f 

with  respect  to  /i  can  be  defined  by 

(1)  E#i[f]-r(/i)f(s°)+2jilMjf(sj). 

A  behavior  pattern  is  a  mapping  £:M-*A.  A  utility  function 
is  a  mapping  u:AxS-»R.  For  a«A,  define  ua:S->X  by  ua  ( s ) -u  ( a  ,  s  )  . 


Utility  function  u  rationalizes  behavior  pattern  /3  if 
(2)  Vp«M  E  (u^^l-aax  KE  [ua],  and 


Another  link  between  expected-utility  theory  and  revealed- 
preference  theory  is  provided  by  Border  (1987),  who  investigates 
choice  among  lotteries.  He  shows  that  choices  can  be  < 

rationalized  by  expected  utility  if  they  respect  stochastic 
dominance,  provided  that  the  set  of  available  lotteries  is  closed 
under  mixtures.  Because  the  utilities  of  actions  in  various  * 

states  are  not  assumed  in  this  paper  to  be  determined  by  monetary 
payoffs  from  the  actions,  these  actions  cannot  be  ranked  by 
stochastic  dominance. 

7Clearly  M  must  be  a  subset  of  {p|  Vj  p . >0  and  En  .  1) . 
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(3)  V  a  e  $  ( M )  3M«M  (j8(/i)-a  and  VaVa  E^ [ u3 ’ ] <E^ [ u3 ] } . 8 

Note  that,  if  ^  is  a  permutation  of  A,  and  if  p '  -<$  •  p  and  u 
rationalizes  p,  then  there  is  a  utility  function  i  '  that 

rationalizes  P':  u  '  (  4>  (  a )  ,  s  ) -u  (  a  ,  s  )  .  Therefore,  whether  or  not  p 

is  r a t i ona 1 i z ab le  depends  only  on  the  partition  {  p  ^(a)|a£A). 
Thus  (2)  and  (3)  may  be  restated  in  terms  of  partitions.  For  II  a 
finite  partition  of  M,  define  an  indirect  utility  function  for  II 
to  be  any  function  v:IIxS-»R.  In  particular,  if  P  is  a  behavior 
pattern,  u  is  a  utility  function,  and  H-l/?  ^(a)|a£A),  then 
v(/?  ^  (  a )  )  ,  s  ) -u  (  a  ,  s  )  is  an  indirect  utility  function  for  II.  With 

these  definitions  of  II  and  v,  and  defining  v*  ( s  ) -v  ( n  ,  z  )  for  well, 

(2)  and  (3)  translate  respectively  as 

(4)  Vjrffll  V/ifTr  E  [v*]-max  ,  _E  [ v*  J,  and 

H  ‘  it’  e.TL 

(5)  V  jt  e  II  3 /i  «  n  Vw  ’  *7 r  E^  [  v*  '  ]  <E^  [v*]  . 

When  II  is  any  finite  partition  of  M  and  conditions  (4)  and  (5) 
are  satisfied  by  II  and  v,  it  will  be  said  that  v  rationalizes  II 
and  that  II  is  expe  c  te  d  -  ut  i  1  i  ty  consistent. 

If  II  were  the  partition  of  M  induced  by  the  expectation  of 
some  random  variable  X,  then  (4)  would  correspond  to  the 
definition  in  statistical  decision  theory  of  v  to  be  a  proper 
scoring  rule  for  E[XJ.  Although  the  goal  of  the  present  research 
(to  characterize  the  partitions  for  which  (4)  can  be  satisfied) 
is  different  from  the  goal  of  research  on  scoring  rules  (to 

g 

Essentially,  (3)  states  that  the  subject  could  not  be 
assured  to  do  as  well  if  any  action  were  deleted  from  A.  Without 

(3) ,  a  constant  utility  function  would  rationalize  every  behavior 
pattern. 
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characterize  the  indirect  utility  functions  for  which  (4)  can  be 
satisfied  with  respect  to  a  given  partition),  the  two  studies  are 
very  closely  related.  In  particular,  Savage  (1971)  observed  the 
fundamental  role  of  convexity  that  will  be  exploited  below.  In 
the  next  section,  Lemma  3  resembles  the  principal  result  of  , 

Osband  and  Reichelstein  (1985).  Osband  (1986)  has  pointed  out 
that  (4)  and  (5)  imply  condition  (9)  of  this  paper,  the  convexity 
of  cl  (ir)  . 

3.  A  necessary  and  sufficient  condition 

The  main  question  to  be  answered  in  this  paper  is:  when  can 
a  finite  partition  of  M  be  rationalized  by  an  indirect  utility 
function?  A  preliminary  answer  is  given  in  this  section.  This 
answer  will  lead  to  a  necessary  and  sufficient  condition  that  can 
be  formulated  in  terms  of  simultaneous  satisfiability,  in  the 
strictly  positive  orthant  of  a  Euclidean  space,  of  a  finite  set 
of  linear  equations.  This  algebraic  formulation  plausibly  will 
lead  to  the  formulation  of  an  efficient  algorithm  to  decide  the 
que s  t ion . 

The  idea  of  the  theorem  is  given  by  the  following  heuristic 
argument.  Suppose  that  v  is  an  indirect  utility  function  for  n, 
a  finite  partition  of  M.  Define  V:M-»K  by 

(6)  V(M)-max?r ,  [  v*  ]. 

ft  *  * 

Since  E^(v  ]  is  an  affine  function  of  n  for  each  ir '  ,  by  (1)  .  V 

is  a  polyhedral  convex  function.  (That  is,  V  is  both  convex  and  t 

piecewise  linear.  It  is  convex  because  it  is  the  pointwise 


supremum  of  a  set  of  affine  functions,  and  piecewise  linear 
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because  the  set  of  affine  functions  is  finite.)  If  (5)  holds, 


then,  by 

the  continuity  of 

r  *  '  i 

[v  ] 

for  each  n ' and 

by 

the 

finiteness 

9 

of  n,  Vfl-ell  int(jr)*0. 

Le  t 

ftc  Int  (  n)  .  Ther 

if 

(4) 

holds,  the 

gradient  of  V  at  n  is 

(7) 

W(/i)-(v(7r,s1)-v(jr,s0)  ,  . 

. . , v(x 

n.  .  0 .  .  10 

,  s  ) -v(x , s  ) ) . 

This  suggests  that  the  interiors  of  the  elements  of  II  are  the 
largest  sets  on  which  the  gradient  of  V  is  defined  and  constant. 
Moreover,  given  any  polyhedral  function  V:M-*R  having  a  gradient 
that  is  related  to  II  in  this  way,  (>)  should  determine  an 
indirect  utility  function  v  that  satisfies  (6). 

This  conjecture  is  true,  and  is  stated  below  as  a  theorem. 
To  facilitate  the  statement  of  the  theorem  and  later  results,  the 
conditions  that  have  just  been  described  will  be  characterized 
more  formally.  Suppose  that 

(8)  A  is  a  finite  cover  of  M  by  polyhedral  convex 

bodies  In  M .  ^ 

Furthermore,  suppose  that  A  is  virtually  a  partition  of  M,  except 
that  the  boundaries  of  its  elements  may  overlap.  That  is, 
suppose  that 

(9)  V<J«A  Vtf'eA  [If  Sf'S',  then  int(S)  n  Int(*")  -  0.] 

9 

Topological  interior,  closure,  and  boundary  are  taken 
re lat ive  to  M . 

^VV  denotes  the  gradient  of  V.  Cf.  Rockafellar  (1970),  p. 

241  . 

^That  is,  (a)  A  is  a  finite  collection  of  subsets  of  M,  (b) 
M  is  the  union  of  this  collection,  (c)  each  of  the  subsets  is  the 
intersection  of  M  with  a  finite  set  of  closed  half-spaces,  and 
(d)  each  subset  has  nonempty  interior  in  M. 
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If  (8)  and  (9)  are  satisfied,  then  call  A  a  polyhedral 
decomposition  of  M.  From  the  discussion  above,  it  should  be 

clear  that  a  polyhedral  convex  function  induces  a  polyhedral 
decomposition  of  its  domain.  Specifically,  the  function  V 
induces  the  polyhedral  decomposition  A  defined  by 
(10)  6t A  iff  3x<*n  [5  -  cl( (m|VV(m)-x) ] . 12 
A  polyhedral  decomposition  that  satisfies  (10)  for  some 
polyhedral  convex  function  V  will  be  called  a  subgradient 
decomposition.  Now,  the  conjecture  -stated  above  can  be 

reformulated  as  the  assertion  that  n  is  expected-utility 
consistent  if  and  only  if  { cl  ( * )  J  w « n }  is  a  subgradient 

decomposition.  This  assertion  is  now  proved  via  several  lemmas. 
Lemma  1 :  If  v  rationalizes  II,  then 

irCcl{/i|  Vir'»*ir  E^[v*  ]<E^[v*]}. 

7T  9  ic 

Proof:  Let  p'en,  and  let  n  satisfy  E^[v  ]<E^[v  ].  p 


exists  by  (5).  Define  n  -a/i  +  ( 1  -a )  fi '  ,  where  ae(0,l).  Then  /i" 

ic  *  ic 

satisfies  Vw'»<jr  E^„[v  ]<E^„[v  ].  The  lemma  follows  by  letting  a 

approach  0.  Q.E.D. 

Lemma  2  :  If  v  rationalizes  II,  then 
int(ir)-(p|  r  E^Jv*  ]<E^[v*J). 

“IT  ^  ^ 

Proof:  { /jt  |  r  E  [v  ]<E  [v  ])  is  open  because  II  is 

ic  * 

finite  and  because  E^[v  ]  is  continuous  in  /x  for  every  w '  ,  so  by 


I 


t 


12 

Condition  (10)  means  that  int(jr)  is  the  subset  of  M  on 
which  7 V(fi)  is  the  unique  subgradient  of  V.  Recall  that  y  is  a 
subgradient  of  the  convex  function  f  at  x  if,  for  all  x', 
f (x ' )>f (x)+y (x' -x) .  This  inequality  is  called  Fenchel's 

inequality.  It  holds  with  strict  inequality  if  y  is  not  a 
subgradient  of  f  at  both  x  and  x'  . 


i 
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“K  '  If 

Lemma  1,  int  ( it )  Q  { n  |  E^[v  ]<E^[v  ]}.  The  inclusion 

{p|  Vtr'^Tr  E^[v  ]<E^[v  ]  }Cint(w)  follows  from  (4)  because 
{/j|  V-k'^tc  E^[vff  ]<E^[v*]}  is  open.  Q.E.D. 

Lemma  3  :  If  v  rationalizes  II,  if  V  is  defined  by  (6),  and  if 

/ieint()r)  ,  then  VV(/j)  is  defined  and  satisfies  (7).  Suppose  also 
that  /i  '  «  int  ( ?r '  )  .  Then 

(11)  E  [v*‘  ]-v(*'  ,s°)+VV(p' )M. 

Proof:  By  (4)  and  (6),  V  coincides  with  an  affine  function 

on  a  neighborhood  of  ft.  Thus  VV(p)  is  defined,  and  (7)  is 
derived  by  computation  using  (1).  The  second  assertion  follows 
directly  from  (4),  (6),  and  (7).  Q.E.D. 

Lemma  4:  Suppose  that  v  rationalizes  II ,  and  that  V  is  defined  by 

(6).  If  ir*ir',  pe  int(ir),  and  ft' e  int(jr'),  then  VV  (ft)  »*VV(/i  '  )  . 

Proof:  By  Lemma  2  and  (11), 

0<[v(?r,  s°) -v(»r'  ,  s°))  ]  +  [VV(/0 -VV(^'  )  ]  ft .  Interchanging  ft  and  ft', 
and  it  and  it'  ,  and  adding  the  resulting  equation  to  the  original 
one,  yields  0<  [  VV  (p  )  -  VV  (p  '  )  }  (fi  -  fi '  )  .  This  requires  that 

[7 V(p) -7V(m' ) 1*0 .  Q.E.D. 

Lemma  5 :  Suppose  that  v  rationalizes  n  and  that  V  is  defined  by 

(6).  Then  V  is  convex.  If  /i«int(ir),  then  int  ( it  )“{fi'  1 7V  (ft )  is 
the  unique  subgradient  of  V  at  /*'}. 

Proof:  Convexity  of  V  is  immediate  from  (6).  Furthermore, 
int  ( it  )£  ( fi  '  |  VV  (ft )  is  the  unique  subgradient  of  V  at  fi'),  by  Lemma 
3.  Now  suppose  that  { in  t  ( x )  .  By  Lemmas  1  and  2,  there  Is  a 
sequence  with  {  n C )  CU^  ,  ^  int  ( it '  )  .  By  the  finiteness  of  II 

there  is  some  n'r*n  such  that  an  infinite  subsequence  of  (pC)  is 
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in  int(ir').  Let  /i'  tint  (*').  By  (7),  7V  (/i C  ) -7V  ( /i '  )  along  this 
subsequence.  Therefore,  both  7V(j*)  and  VV(^')  are  subgradients 
of  V  at  /i",  by  Theorem  24.4  of  Rockafellar  (1970).  This  shows 
that  lp'jW(/0  is  the  unique  subgradient  of  V  at  /*' )Cint(ir)  . 
Q.E.D. 


Lemma  6  :  Suppose  that  V:M-*R  is  a  polyhedral  convex  function,  and 

that  II  and  V  satisfy  (8)  and  (10).  Then  there  is  an  indirect 
utility  function  v  that  rationalizes  II. 

Proof:  Let  ir«n.  By  (8),  int(w)  is  nonempty.  Let  p«int(7r). 

In  order  to  satisfy  (11)  and  (7),  respectively,  define 
v  (  tc  ,  ) -V  (/i)  -  7V  (p)  n  ,  and  define  v(ir  ,  s-^  )- [  7V  (p  )  ]  .  +v  (w  ,  s^  )  .  Then, 


if 


H  '  t  i  n  t  (  n  '  )  , 


£ ^ [ v  ]-f (/*)  [ V(M'  )  -7V(m'  )P'  ]  + 


Sj>lMj  (  fVV(>1'  )  1  j +  [*(/»'  )-7V(M'  )M'  1  )-V(m'  )+7V(m*  )  in- li’  )  •  Thus 

E  [v*']  -*V  (/i)  ,  and  E  [v*  ]<V(#x)  if  n'f*ir  by  Fenchel's  inequality  and 
1 3 

(10).  This  establishes  (5).  (4)  is  established  by  a  limit 

argument,  using  (8)  and  the  continuity  of  V.  Q.E.D. 

Theorem  1  :  II  is  expected-utility  consistent  if  and  only  if 

{  cl  (* )  |  irell }  is  a  subgradient  decomposition. 

The  proof  of  Theorem  1  is  immediate  from  the  lemmas. 

4.  Some  partitions  that  are  not  exoe c t e d - u t i 1 i tv  consistent 

The  requirement  that  an  expec ted - u t i 1 i ty - cons i s tent 
partition  should  essentially  coincide  with  a  polyhedral 
decomposition  is  intuitive  and  geometrically  concrete.  Convexity 
is  the  key  economic  aspect;  it  means  that  if  the  same  act  is 


l  3 

Fenchel’s  inequality  is  the  statement  that,  for  a  convex 
function  f,  f ( x ' ) > f ( x ) +y ( x ' - x ) ,  with  strict  inequality  if  y  is 
not  a  subgradient  of  f  at  x' . 


1  2 

preferred  ac  both  p  and  p  ,  then  that  act  will  be  preferred  for 

1  2 

any  random  draw  between  p  and  p  . 

The  additional  requirement  (10)  imposed  on  a  subgradient 
decomposition  is  less  intuitive  and  more  abstract.  This  section 
attempts  to  foster  some  intuition  for  the  role  of  (10)  by 
presenting  two  polyhedral  decompositions  that  fail  to  be 
subgradient  decompositions,  and  by  showing  that  these  cannot  be 
obtained  from  expec ted - ut i 1 i ty - cons i s tent  partitions.  Both  of 
these  examples  assume  three  states  of  nature  (n-2),  allowing  M  to 
be  graphed  (by  making  an  affine  transformation)  as  a  neighborhood 
of  the  origin  in  the  plane. 

Example  1 :  The  first  example  is  graphed  in  Figure  1.  It 

consists  of  three  polyhedral  convex  sets.  5^  is  the  intersection 

2  3 

of  M  with  the  lower  half  plane,  and  6  and  6  are  the 
intersections  of  M  with  the  right  and  left  quadrants  of  the  upper 
half  plane,  respectively. 

Suppose  that  these  three  sets  were  the  images  of  the 
intersections  of  M  with  the  parts  of  the  probability  simplex 
where  acts  1,  2,  and  3  were  weakly  preferred  by  an  expected- 

util  i  ty  -  maximiz  ing  agent.  If  act  3  were  stricken  from  the  choice 

3  12 

set,  then  6  would  have  to  be  "divided  up"  between  S  and  6  so 

as  to  leave  a  polyhedral  decomposition.  Clearly,  the  only  way  to 

2 

do  this  would  be  to  extend  6  to  cover  the  entire  upper  half  of 
the  plane.  It  follows  that  the  agent  would  be  indifferent 
between  act  1  and  act  2  along  the  same  border  that  was  originally 
the  line  of  indifference  between  act  2  and  act  3.  By 
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transitivity,  the  agent  would  have  to  be  indifferent  between  act 
1  and  act  2  along  that  border.  But  this  cannot  be,  since  that 
border  is  not  contained  in  5^.  Therefore  the  decomposition  is 
not  induced  by  an  expec ted- ut il i ty - cons i s tent  partition. 

Example  2:  The  second  example  is  shown  in  Figure  2.  It  is 
constructed  from  two  concentric  squares.  Let  the  vertexes  of  the 
smaller  square  have  coordinates  (±1,±1),  and  let  the  vertexes  of 
the  larger  square  have  coordinates  (±3, ±3).  Connect  (1,1)  to 


(3,2)  , 

(-1,1) 

to  (-2,3),  (-1,-1) 

to  (-3,-2) 

,  and 

(1,-D  to 

(2,-3). 

This 

construction  divides 

the  larger 

square 

into  five 

polyhedral  convex  sets.  Let  the  set  to  the  right  of  the  smaller 
square  be  6® ,  and  number  the  other  sets  outside  the  smaller 
square  in  counterclockwise  order.  Let  the  smaller  square  itself 
be  &**  . 

This  polyhedral  decomposition  can  be  shown  not  to  be  induced 

by  any  expected-utility-consistent  partition  by  a  similar  method 

as  above.  If  act  4  were  eliminated  from  the  choice  set  and  if 

4 

the  agent  were  an  expected  utility  maximizer,  then  S  would  have 
to  be  divided  up  among  the  other  four  regions  so  as  to  leave  a 
polyhedral  decomposition.  But  in  fact  even  enlarging  5^  through 

4 

6  as  much  as  convexity  will  allow  leaves  a  square  "hole"  around 
the  origin,  as  Figure  3  shows.  Hence  the  decomposition  cannot  be 
induced  by  an  expec ted - ut il i ty - c ons i s tent  partition.  In  economic 
terms,  there  is  a  circularity  of  preferences  in  the  interior  of 
the  hole:  Act  0  is  strictly  preferred  to  act  1,  act  1  is  strictly 
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preferred  to  act  2,  act  2  is  strictly  preferred  to  act  3,  and  act 
3  is  strictly  preferred  to  act  0. 

Incidentally,  this  example  shows  that  being  a  subgradient 
decomposition  is  not  always  stable  under  small  perturbations. 
Consider  the  5-element  decomposition  formed  by  connecting  each 
vertex  of  the  smaller  square  to  the  corresponding  vertex  of  the 
larger  square.  Thxs  is  the  subgradient  decomposition  induced  by 
the  function  V  defined  by  V(x,y)-0  if  max(|x|,|y|)  1,  and 
V  (x  ,  y  ) -max  (  |  x  |  ,  j  y  |  )  - 1  otherwise.  Clearly,-  though,  an  arbitrarily 
small  rotation  of  each  of  the  diagonal  edges  of  this 
decomposition  will  transform  it  to  one  that  is  qualitatively  like 
the  decomposition  of  the  example.  It  is  conceivable  that  this 
sort  of  instability  might  lead  to  inappropriate  rejection  of  the 
hypothesis  of  expected-utility  maximization  due  to  measurement 
error. 

5.  Characterization  of  subgradient  decompositions 

The  foregoing  examples  provide  some  intuition  for  why  only 
subgradient  decompositions,  rather  than  all  polyhedral 
decompositions,  can  arise  from  expected-utility  maximization. 
However,  the  examples  do  not  directly  provide  a  concrete  picture 
of  what  subgradient  decompositions  look  like  or  a  computational 
procedure  for  recognizing  them.  The  problem  is  that  condition 
(10)  requires  the  existence  of  an  unspecified  polyhedral  convex 
function.  For  any  given  polyhedral  decomposition,  we  would  like 
to  be  able  to  recognize  whether  a  function  satisfying  the 
condition  exists,  and  to  be  able  to  construct  it  if  it  does 
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exist.  Further  analysis  of  the  examples  in  this  section  will 
lead  to  such  a  constructive  characterization. 

Two  observations  are  in  order.  First,  both  of  the  examples 
have  been  studied  in  the  same  way:  by  removing  elements  of  the 
polyhedral  decomposition,  trying  to  enlarge  the  remaining 
elements  to  fill  the  gaps  without  violating  the  convexity 
restrictions  imposed  by  expected-utility  maximization,  and 
finally  checking  for  violations  of  transitivity.  It  is  plausible 
that  this  strategy  could  be  general ize-d ,  and  that  it  would 
provide  a  characterization  of  the  sort  that  we  are  seeking. 

However,  we  will  derive  a  somewhat  different  characterization 
below . 

Second,  in  the  examples  the  impossibility  of  satisfying  (10) 

has  been  quite  apparent,  but  in  general  this  issue  will  be  more 

subtle.  A  third  example,  closely  related  to  Example  1,  will  make 

this  point  clear.  To  motivate  this  example,  consider  how  it  can 

be  shown  by  contradiction  that  Example  1  does  not  satisfy  (10). 

Imagine  that  (10)  held,  and  think  of  Figure  1  as  a  view  from 

above  of  the  polyhedral  convex  function  V.  To  satisfy  (10)  all 

of  the  flats  of  this  graph  must  have  different  slopes.  The  flats 
2  3 

lying  above  8  and  S  are  obtained  by  creasing  the  graph  above 

2  3 

the  boundary  of  61,  so  that  the  flats  lying  above  8  and  8 

become  "flaps"  attached  to  the  flat  lying  above  5^.  In  order  for 

those  flaps  to  have  different  slopes,  though,  the  surface  of  the  I 

2  3 

graph  must  be  "torn"  above  the  boundary  between  8  and  8  .  This 

cannot  happen,  because  it  would  imply  that  V  were  discontinuous. 
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Now,  consider  a  more  subtle  example  for  which  continuity 

considerations  do  not  suffice  to  show  the  impossibility  of 

satisfying  (10) . 

Example  3 :  This  example  is  graphed  in  Figure  4.  In  place  of  the 
boundary  between  the  two  subsets  of  the  upper  half  plane  in 
Example  1,  there  is  now  an  additional,  wedge-shaped  subset.  The 
graph  of  V  can  be  adjusted  above  this  wedge  to  repair  the 
discontinuity  in  the  earlier  example.  However,  the  resulting 
function  will  not  be  convex. 

To  present  this  example  algebraically,  define 

51-{(x,y)Jy  0),  *2-{(x,y))0  y  x),  $3-((x,y)}0  y  -x) ,  and 

4  i 

6  - ( ( x , y ) | y>max [ x , - x ] J .  These  four  sets  comprise  a  polyhedral 

decomposition  A.  The  interiors  of  the  elements  of  A  are  Che 

regions  on  which  the  gradients  of  a  p iecewise - 1 inear  function  V 

is  defined  and  constant.  Specifically,  let  V(x,y)-0  on  S * , 

2  3  4 

V(x,y)-y  on  6  ,  V(x,y)--y  on  S  ,  and  V(x,y)-x  on  5  .  However  V 

is  not  convex,  as  is  easily  seen  by  considering  V(x,l)  as  a 

function  of  x. 

Of  course,  the  fact  that  a  polyhedral  decomposition  A  is 

induced  by  a  non-convex  function  does  not  mean  that  it  cannot 

14 

also  be  induced  by  some  other  function  that  Is  convex.  In  that 

case,  A  would  satisfy  (10)  and  be  a  subgradient  decomposition. 

The  same  kind  of  argument  as  was  given  for  Example  1  will  work 

2  4 

here  also,  but  two  of  the  elements  of  A  --  either  6  and  6  ,  or 

14 

We  were  led  to  formulate  Example  3  by  J.-P.  Benoit, who 
raised  the  question  of  whether  (10)  is  more  restrictive  than  it 
would  be  if  only  the  piecewise  linearity  of  V  were  required. 
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3  4 

else  8  and  8  ,  will  have  to  be  deleted  before  trying  to  extend 

the  remaining  subset  of  the  upper  half  plane,  if  transitivity  is 

4 

to  lead  to  a  contradiction.  Moreover  it  is  obvious  that,  if  8 

were  to  be  subdivided  into  m  wedges  radiating  from  the  origin, 

2  3 

then  all  of  these  plus  either  8  or  S  --a  total  of  m+1  elements 

of  the  polyhedral  decomposition  --  would  have  to  be  deleted 

before  transitivity  would  lead  to  a  contradiction.  Thus  the 

strategy  of  identifying  violations  of  (10)  by  making  deletions 

can  become  very  complex,  even  when  there  are  only  three  states  of 

nature.  It  would  be  desirable  to  have  a  criterion  that  is  of 

bounded  complexity.  Such  a  criterion,  involving  only  the 

inspection  of  pairs  of  adjacent  elements  of  A,  is  now  provided. 

Notice  that  in  both  Example  1  and  Example  3,  the  nonnegative 

2 

ray  of  the  x-axis  Is  a  face  of  5  ,  but  is  not  an  entire  face  of 

5^.  It  turns  out  that  such  a  mismatch  of  faces  must  not  occur, 
if  (10)  is  to  hold.  This  no-mismatch  condition  can  be 

reformulated  in  such  a  way  that  a  graph  can  be  associated  with  A 
if  it  is  satisfied,  and  this  graphical  interpretation  will  be 
useful  later. 

To  state  the  condition  in  this  way,  let  5  be  a  polyhedral 
convex  subset  of  MCRn  with  nonempty  interior.  Define  an  edge  of 
8  to  be  the  relative  interior  of  an  ( n - 1 )- dimens ional  face  of  8. 
Define  the  polyhedral  decomposition  A  of  M  to  be  a  polyhedral 
graph  if,  whenever  w  is  an  edge  of  8  and  w'  is  an  edge  of  S'  and 
u nw'f0,  then  w-w'.  Note  that,  according  to  these  definitions, 
the  elements  of  A  are  vertexes  of  a  finite  graph,  and  two  such 
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vertexes  are  adjacent  in  the  graph  if  and  only  if  they  share  an 
(n- 1) -dimensional  face  in  the  decomposition. 

Theorem  2  :  Every  subgradient  decomposition  of  M  is  a  polyhedral 

graph . 

Proof:  Suppose  that  A  is  a  subgradient  decomposition.  By 
(10),  a  function  c:A-*!R  and  a  1-1  function  g  :  A-*Rn  exist  such  that 

(12)  V/ieM  V(/i)-max5eA[c(5)+g($  )m!  ,  and 

(13)  VSeA  S-{p\V(p)-c(S)+g(6)p)  . 

A  contradiction  will  be  derived  from  the-  supposition  that  A  is 
not  a  polyhedral  graph.  Suppose  particularly  that  u  is  an  edge 
of  5,  w '  is  an  edge  of  S',  p'  ewnu'  ,  and  wOw'  is  a 

relatively  open  set  of  a  hyperplane  H.  Since  (13)  holds 
everywhere  on  this  set  with  respect  to  both  6  and  S' ,  c(6)—c(S') 
and  [g(5')-g(5)l  is  perpendicular  to  H.  w\w'  has  nonempty 
relative  interior  in  H,  so  there  must  be  a  third  element  5"  of  A 
having  an  edge  w"  and  a  member  /*"  such  that  p”  eun w"  .  Again,  it 
must  be  that  c(S)-c(5n)  and  [  g ( S " ) - g ( S ) ]  is  perpendicular  to  H. 
That  is,  if  p  is  the  unique  unit  vector  perpendicular  to  H  and 
such  that  p(5-H)C(-«,0]  ,  then  [g(5')-g(6)]  and  [ g ( 5 " ) - g ( 5  )  ]  are 
scalar  multiples  of  p  by  positive  constants  a’  and  a", 
respectively.  Now,  for  some  positive  constant  7,  p ' +7p e int ( 5 ' ) 
and  /i"+7pe  int  (5").  For  p-p' +rp ,  (12)  and  (13)  imply  that  a’Sa", 

and  for  p-p"  +  yp ,  these  equations  imply  that  a"aa' .  That  is, 
a'-a”,  contradicting  g  being  1-1.  Q.E.D. 


Theorem  2  states  that  being  a  polyhedral  graph  is  a 
sary  condition  for  a  polyhedral  decomposition  to  be  a 
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subgradient  decomposition,  which  is  what  is  related  to  expected- 
utility  maximization  via  Theorem  1.  However,  Example  2  shows 
that  being  a  polyhedral  graph  is  not  sufficient  to  be  a 
subgradient  decomposition.  Figure  2  shows  clearly  that  Example  2 
is  a  polyhedral  graph,  but  it  has  already  been  shown  not  to 
satisfy  (10).  Further  consideration  of  Example  2  will  lead  to 
the  formulation  of  an  additional  necessary  condition  for  a 
polyhedral  graph  to  be  a  subgradient  decomposition,  and  it  will 
turn  out  that  this  new  condition  is  sufficient  as  well  as  being 
necessary.  The  proof  of  sufficiency  will  be  constructive. 
Specifically,  it  will  show  that  the  condition  is  equivalent  to 
the  consistency  of  a  finite  system  of  weak  linear  inequalities, 
and  it  will  provide  an  algorithm  for  defining  a  polyhedral  convex 
function  V  satisfying  (10)  if  the  condition  holds. 

We  are  now  going  to  provide  a  different  argument  that 
Example  2  does  not  satisfy  condition  (10).  This  argument  relies 
heavily  on  the  fact  that  Example  2  is  a  polyhedral  graph. ^  For 


The  argument  could  be  reformulated  to  apply  to  any 
polyhedral  decomposition.  To  do  so,  associate  a  finite  graph 
with  the  decomposition  by  stipulating  that  5  and  S'  share  an 

edge,  the  relative  interior  of  5n5’,  whenever  that  set  has 
dimension  n-1.  Then  the  present  argument  and  the  proof  of  the 

characterization  theorem  (Theorem  3)  below  would  remain  sound 
without  any  essential  change,  and  Theorem  2  could  be  derived  from 
Theorem  3  as  a  corollary.  That  approach  would  have  the  advantage 
of  making  as  clear  as  possible  how  fundamental  are  the 
inte gr ab i 1 i ty  considerations  with  which  Theorem  3  is  concerned. 
We  have  proved  Theorem  2  directly  because  it  is  of  some 

independent  interest  to  pursue  as  far  as  possible  the  elementary 
approach  to  characterization  (i.e.,  without  transforming  the 

problem  into  one  regarding  consistency  of  a  system  of  linear 
inequalities)  that  has  been  studied  so  far  in  the  paper.  In 
addition,  the  direct  proof  is  actually  shorter  than  is  the  proof 
from  Theorem  3.  Theorem  2  shows  that  there  is  no  loss  of 
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i  k  4 

any  distinct  6J  and  S  that  share  an  edge,  let  be  a  nonzero 

2 

vector  of  S  satisfying 

(14)  ejk(Sk-sj )C[0,«O  . 

i  k  i  k 

£J  is  called  the  directed  edge  from  5J  to  5  .  Let  E  be  an 

assignment  of  these  vectors,  one  vector  to  each  ordered  pair  of 

1  k 

vertexes  that  share  an  edge.  (That  is,  if  SJ  and  S  are 

i  Ic  k  i 

adjacent,  then  E  will  specify  both  non-zero  vectors  cJ  and  c  J.) 
For  any  polyhedral  graph  A  and  set  £  of  vectors  satisfying  these 

conditions  (where  ECSn  if  M£Rn) ,  call  (A.E-)  a  polyhedral  directed 

Ik  Ik 

graph.  If  5J  and  S  share  an  edge,  then  (14)  determints  cJ  up 

to  a  positive  scalar  multiple,  since  the  shared  edge  has 

■k  1 

dimension  (n-1)  and  6  -  5  J  has  nonempty  interior.  This  fact  and 
Fenchel's  inequality  imply  the  following  two  lemmas. 

Lemma  7  :  Suppose  that  A  is  a  subgradient  decomposition,  and 

specifically  that  the  convex  function  V :  M->R ,  the  function  c  :  A-*B , 
and  the  1-1  function  g:A-*Rn  satisfy  (12)  and  (13).  (That  is  if 
peint(S),  then  VV(p)-g(5).)  If 

(15)  cjk-g(5k)-g(5j), 

i  k 

and  if  E  is  the  set  of  all  of  these  vectors  for  6J  and  6  sharing 
a  common  edge,  then  (A,E)  is  a  polyhedral  directed  graph. 

Lemma  8  :  For  every  polyhedral  graph  A,  at  least  one  polyhedral 


directed  graph  (A,E)  exists.  If  (A,E)  and  (A,F)  are  polyhedral 

i  k  i  k 

directed  graphs  defined  from  A,  having  directed  edges  cJ  and 

1  k 

respectively,  then  there  exist  positive  scalars  aJ  such  that 


(16) 


generality  in  restricting  our  attention  to  polyhedral  graphs. 
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Directed  edges  will  now  be  defined  for  the  polyhedra1  graph 

i  Vc 

A  of  Example  2,  and  it  will  be  shown  that  positive  scalars  aJ 
cannot  be  found  to  satisfy  (15)  and  (16)  for  any  1-1  function 


g : A-*  . 


By  Lemma  7,  '.hen,  A  cannot  be  a  subgradient 


de  comp  o  s i t i on . 

One  set  F  of  directed  edges  satisfying  (14)  consists  of: 
C40-(1,0),  £41-(0,1)  ,  £42“(-l,0)  ,  £43-(C,-l),  and  £01-(-l,2), 

£12“(-2, -1)  ,  C23-(l,-2),  ^30-(2,l)  ,  and  £jk--fkj  for  the 

remaining  elements. 

Suppose  that  a  polyhedral  directed  graph  could  alternatively 

be  induced  on  A  by  the  subgradient  vectors  of  a  polyhedral  convex 

function,  as  envisioned  by  Lemma  7.  That  is,  suppose  that  g :  A-+Rn 

i  k 

is  1-1,  and  that  (15)  and  (16)  define  positi\e  scalars  aJ  (where 
1  k 

the  vectors  cJ  are  as  in  Lemma  8).  A  contradiction  will  be 
derived.  By  (15), 

(17)  0-£40+£°L+£14. 

Applying  (16)  to  (17)  yields 

/hm  n  01,01  14,14 

(18)  0-or  f  +a  £  +a  £ 

The  x-coordinate  of  (18)  is 

(19)  O-c40-*01, 
and  the  y-coordinate  is 

(20)  0-2a01-a14. 

Multiplying  (19)  by  2  and  adding  (20)  yields 

(21)  0-2q40 - q 14 . 

Bv  (15),  £43’--c3'4.  Also  f4^--f34,  so  a4^-a34  by  (16).  Applied 

to  (20)  and  (21),  this  yields 
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(22)  a41-2a4°. 

Similarly  it  can  be  shown  that 

40  -  43  43-42  .  42  0  41 

(23)  a  -2a  ,  a  —2a  ,  and  a  —2a 

40  40 

But  (22)  and  (23)  imply  that  a  -16a  ,  which  is  only  possible  if 

40 

a  -0.  This  would  violate  Lemma  8,  though,  so  a  contradiction  has 
been  reached.  Thus  4  cannot  be  a  subgradient  decomposition, 
although  it  is  a  polyhedral  graph. 

In  the  context  of  a  specific  example,  it  has  just  been  shown 
that  a  subgradient  decomposition  must  satisfy  a  discrete  version 
of  an  integrability  condition.  Integrability  conditions  are 
familiar  from  revealed-pref erence  theory  where,  In  the  presence 
of  subsidiary  technical  conditions,  they  are  sufficient  as  well 
as  necessary  for  behavior  to  be  consistent  with  utility 
maximization.  The  present  integrability  condition  will  now  be 
stated  in  general  terms,  and  will  be  shown  to  be  sufficient  as 
well  as  necessary  for  a  polyhedral  graph  to  be  consistent  (in  the 
sense  of  Theorem  1)  with  expec ted- util i ty  maximization. 

An  elementary  circuit  in  a  directed  graph  is  a  path  from 
some  vertex,  following  edges,  until  the  initial  vertex  has  been 
reached  again.  A  polyhedral  directed  graph  is  a  directed  graph, 
with  the  vertexes  being  the  elements  of  4.  The  polyhedral  graph 
4  shown  in  Figure  2  was  proved  not  to  be  a  subgradient 
decomposition  by  proving  that,  in  any  polyhedral  directed  graph 
(4,E),  the  vector  sum  of  the  directed  edges  of  some  elementary 
circuit  must  be  nonzero.  In  fact,  the  condition  that  E  can  be 


found  for  which  every  such  vector  sum  equals  zero  is  both 
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necessary  and  sufficient  for  A  to  be  a  subgradient  decomposition. 

This  equivalence  is  stated  as  a  theorem,  after  a  formal 

definition  of  an  elementary  circuit  is  given. 

2 

A  subset  7CA  is  an  elementary  circuit  of  the  polyhedral 
graph  A  if  there  is  a  subset  DCA  such  that 

(24)  7  c  {(j,k)|jfD  and  k«D  and  6^  and6k  share  an  edge), 

(25)  Vj*D  3 ! k  (j  , k) € 7 , 

(26)  Vke D  3! j  (j  ,k)«7,  and 

(27)  No  nonempty  7*07  satisfies  (24),  (25)  and  (26)  with  D 

replaced  by  any  D'CA. 

Define  I"  to  be  the  set  of  elementary  circuits  of  A. 

Theorem  3:  If  A  is  a  polyhedral  graph,  then  A  is  a  subgradient 

decomposition  if  and  only  if  there  exists  a  polyhedral  directed 
graph  (A,E)  such  that  the  sum  of  directed  edges  along  every 
elementary  circuit  is  zero,  i.  e., 

<28>  v-"r  S(j.k><-r‘jk-0' 

Outline  of  the  proof:  Suppose  that  A  is  a  subgradient 

decomposition,  satisfying  (10)  with  respect  to  the  convex 
function  V.  Then  E  can  be  defined  by 

(29)  cJk-7V(pk)-7V(pj) , 

i  i  k  k  1  k 

where  pJ«5J  and  n  eS  ,  whenever  SJ  and  5  share  an  edge. 

Conversely,  suppose  that  (28)  holds.  A  convex  function  V 

satisfying  (10)  will  be  constructed.  At  stage  1,  define  V(^)-0 

for  fi(S^  and  7V(p)-0  for  ^eint(J^).  For  t>l ,  define  D  »{fi|  V  and 

7V  have  been  defined  for  S  by  stage  t-1).  Suppose  that  A\Dc><0. 

It  is  easily  shown  that  a  polyhedral  graph  is  connected,  which 


1  k 

implies  that  there  are  j  and  k  such  that  SJ  and  S  share  an  edge 

1  Ic  i  if  k 

& ,  £  D^,  and  S  fD^.  Let  (icw,  n'eint(5  ),  and  fi"eS  .  Then,  at 

k  .  k 

stage  t,  define  VV  on  int(S  )  by  (29)  and  define  V  on  S  by 

(30)  V(m")-V(m)+VV(m')(p"-/0  . 

By  (28),  this  recursive  definition  is  consistent.  The  proof  that 

V  is  convex  is  based  on  (14).  Q.E.D. 

The  motivation  for  Theorem  3  is  to  determine  whether  the 

convex  function  required  by  (10)  in  Theorem  1  exists.  By 

Theorems  2  and  3,  this  question  can  be  answered  by  finding 

whether  A- (  c  1  (w )  |  well }  is  a  polyhedral  graph,  and,  if  so,  whether 

a  polyhedral  directed  graph  (A,E)  exists  that  satisfies  (28).  A 

reduction  of  this  latter  question  to  a  system  of  linear  weak 

inequalities  is  now  obtained.  In  order  to  do  this,  first  form  a 

polyhedral  directed  graph  (A,F)  which  is  guaranteed  by  Lemma  8  to 

exist.  (For  instance,  F  may  be  constructed  by  taking  unit 

vectors  that  satisfy  (14)  to  be  the  directed  edges  .)  Then, 

by  Lemma  8,  (28)  can  be  satisfied  if  and  only  if  there  exist 

i  k  1  k 

positive  scalars  aJ  (defined  whenever  SJ  and  6  share  an  edge) 
such  that 

(31)  *7<r  Z(J  k)<ia*V,t-0. 

By  the  linearity  of  the  equations  in  (31),  the  existence  of 

i  k 

positive  solutions  is  equivalent  to  the  existence  of  aJ  that 
satisfy  both  (31)  and  also 

(32)  k> 1 . 

Thus,  (31)  and  (32)  define  a  system  of  linear  weak  inequalities, 
the  consistency  of  which  system  is  equivalent  to  A  being  a 
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subgradient  decomposition,  and  hence  equivalent  to  there  existing 
a  utility  function  that  rationalizes  the  partition  II  from  which  & 
is  derived. 

6 .  Cone lus ion 

Standard  axiomatizations  of  expec te d - ut i 1 i ty  theory  envision 
an  agent  with  fixed  probability  assessments,  who  can  be  observed 
to  choose  actions  from  varying  opportunity  sets  (for  instance, 
pairs  of  lotteries)  These  axiomatizations  also  envision  that 
the  agent's  preferences  among  these  actions  depend  on  the  state 
of  nature  only  through  the  state  *  dependent  consequences  of  the 
actions,  and  that  these  consequences  are  clearly  defined  and 
observable.  In  this  paper,  we  have  suggested  that  this 
conception  of  the  observable  consequences  of  the  theory  may  be  an 
unsatisfactory  basis  for  empirical  testing  of  the  theory. 
Rather,  there  may  be  many  opportunities  to  observe  choices  from  a 
fixed  set  of  actions  as  probability  assessments  change.  We  have 
also  suggested  that  the  specification  of  utilities  in  terms  of 
consequences  may  be  quite  difficult  to  implement  empirically.  On 
this  view,  a  less  restrictive  specification  of  state  -  dependent 
utility  needs  to  be  studied. 

It  might  be  asked  whether  this  new  conception  of  observable 
consequences  of  expec ted - ut i 1 i ty  maximization  places  any  testable 
restrictions  on  observation.  The  results  of  this  paper  show  that 
there  will  be  testable  restrictions  if  the  decision  maker's 
probability  assessments  can  be  observed  or  estimated.  One  of 
these  restrictions  is  that  each  action  must  be  chosen  for  all 
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probability  measures  in  the  interior  of  some  convex  subset  of  the 
probability  simplex.  The  example  constructed  in  section  5  (and 
depicted  in  Figure  1)  shows  that  this  convexity  restriction  does 
not  exhaust  the  empirical  content  of  the  theory.  Theorem  3  fully 
characterizes  this  empirical  content  in  terms  of  a  discrete 
version  of  an  integrabi lity  condition,  familiar  from  the  theory 
of  choice  under  certainty. 

There  are  at  least  two  ways  in  which  the  decision  maker's 
probability  assessments  might  be  treated  as  observable  or 
estimable.  One  would  be  to  elicit  them  directly,  perhaps  using  a 
"proper  scoring  rule"  that  would  provide  incentives  for  truthful 
revelation.  The  other  would  be  to  impose  a  rational- 
expectations  assumption  that  the  decision  maker  knows  the  true 
statistical  distribution  from  which  observations  are  drawn. 
Consider,  for  instance,  the  example  of  medical  diagnosis  that  was 
discussed  in  the  introduction.  The  medical  history  and 
examination  results  that  constitute  the  doctor's  evidence  about  a 
patient  can  be  viewed  as  a  dis ere te - valued  measurement.  This 
measurement  partitions  the  set  of  patients  into  finitely  many 
equivalence  classes.  That  is,  all  patients  within  a  class  have 
identical  medical  histories  and  examination  results.  Within  each 
equivalence  class,  there  is  a  sample  distribution  of  eventual, 
accurate  diagnoses . ^  These  diagnosis  outcomes  can  be  treated  as 

^Edwards  (  19  72,  pp  .  139-140)  quoted  by  Shortliffe  and 
Buchanan  (1984,  p.236)  has  described  some  of  the  practical 
difficulties  of  gathering  and  interpreting  such  a  data  set. 
Nevertheless,  particularly  because  the  use  of  computers  has 
dramatically  improved  the  quality  of  medical  data  available,  the 
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an  independent  sample  drawn  from  the  doctor's  posterior 
probability  distribution  conditioned  on  the  evidence.  The 
likelihood  function  regarding  the  beliefs  of  the  doctor  can  then 
be  formed  in  a  standard  way.  This  rational - expec tations  approach 
to  estimation  has  two  particular  virtues  when  the  decision  maker 
is  an  expert  in  the  decision  domain  being  studied.  First,  such 
an  expert  will  have  substantial  experience,  and  standard  results 
of  decision  theory  imply  that  conditioning  on  such  experience 
should  closely  approximate  the  distribution  of  the  evidence. 
Second,  regardless  of  what  are  the  subjective  beliefs  of  the 
decision  maker,  whether  an  expert  optimizes  with  respect  to  the 
empirical  distribution  of  a  large  sample  of  decision  situations 
is  of  independent  interest.  This  objective  assessment,  rather 
than  the  expert's  own  subjective  assessment  of  success,  might 
even  be  what  the  expert's  clients  are  most  eager  to  know. 

Finally,  it  might  be  asked  whether  the  characterization  of 
expected-utility  maximization  provided  in  Theorem  3  can 
potentially  be  of  direct  use  in  the  kind  of  empirical  study  that 
is  suggested  here.  This  question  arises  because  any  data  set 
will  be  finite,  so  its  consistency  with  expected-utility  theory 
can  be  determined  directly  by  seeing  whether  finitely  many 
instances  of  inequality  (2)  are  a  consistent  system.  The  answer 
to  the  question  has  to  do  with  computational  complexity.  Let  a 
be  the  number  of  actions  available  to  the  decision  maker.  Then 
the  number  of  linear  inequalities  that  must  be  solved  to 

use  of  such  data  may  well  be  feasible. 


29 


implement  the  direct  approach  is  (o-l)  times  the  cardinality  of 
the  data  partition  that  was  defined  in  the  preceding  paragraph. 
Theorem  3  concerns  a  polyhedral  graph  that  has  at  most  a(a-l)/2 
edges,  and  the  number  of  linear  inequalities  that  have  to  be 
solved  is  roughly  equal  to  the  number  of  states  of  nature  times 
the  number  of  elementary  cycles  of  this  graph.  If  the  sets  of 
states  of  nature  and  of  actions  are  small,  but  the  data  partition 
is  fine,  it  might  be  possible  to  apply  the  geometric  insights  of 
Theorem  3  to  minimize  the  amount  of  computation  that  is  required. 
In  particular,  only  comparisons  between  pairs  of  actions 
corresponding  to  adjacent  vertexes  of  the  polyhedral  graph,  and 
comparisons  made  at  partition  elements  having  sample 
distributions  close  to  the  boundaries  of  those  vertexes,  may  be 
necessary.  If  so,  then  Theorem  3  may  have  considerable  practical 
importance . 
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